Small But Slow World: How Network Topology and Burstiness Slow Down Spreading 
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While communication networks show the small-world property of short paths, the spreading dy- 
namics in them turns out slow. Here, the time evolution of information propagation is followed 
through communication networks by using empirical data on contact sequences and the SI model. 
Introducing null models where event sequences are appropriately shuffled, we are able to distinguish 
between the contributions of different impeding effects. The slowing down of spreading is found to 
be caused mainly by weight-topology correlations and the bursty activity patterns of individuals. 
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Most complex physical, biological and social networks 
show the small-world property, where the average short- 
est path length is strikingly short when compared to the 
network size This means that there is at least one 
short path between any two nodes, which should give rise 
to rapid transmission of influence. However, dynamic 
phenomena on networks [2j, such as spreading of pan- 
demics, electronic viruses, and information, follow their 
own pathways, which are not necessarily topologically ef- 
ficient [3]. Spreading on real small- world networks turns 
out to be surprisingly slow, e.g., new infections by a com- 
puter virus are reported years after its emergence or the 
introduction of an anti- virus [4j . Here we aim at resolving 
this puzzle. For issues such as strategies and timing of 
vaccinations, improvement of information diffusion, and 
the slow decay of prevalence of computer viruses, it is cru- 
cial to understand the role of the underlying network and 
temporal activity patterns in the dynamics of spreading. 

The dynamics of spreading is commonly studied with 
SI, SIR, or SIS models [5j on static lattices or in mean 
field, where the dynamics is defined by state changes 
of individuals between (S)usceptible, (I)nfectious, and 
(R)ecovered. These models lead to a rapid, exponential 
growth of prevalence at early stages of spreading, while 
the dynamics at later stages depend on the model and 
lattice. For the SI process, the prevalence grows until the 
whole system reachable from initial conditions is infected, 
with exponential slowing down towards the end. For the 
SIR process, competing effects set in and the spreading 
may remain local or percolate through the system while 
the SIS process has more complex dynamics. 

While these results capture some of the qualitative fea- 
tures of real- world processes, the heterogeneity of the sys- 
tems limits their applicability. First, the interactions of 
real-world systems span networks by broad distributions 
of node connections and mesoscopic features in the form 
of communities with dense internal and sparse external 
connectivity. Second, interaction intensities vary and are 
closely coupled to network topology. Third, the daily cy- 
cle and bursty character of interaction events give rise to 



important temporal inhomogeneities. 

Some aspects of these features have already been stud- 
ied. For static networks, it is known that spatial struc- 
ture has an effect on epidemics (see, e.g., @, 0]), and 
community structure slows down information diffusion 
due to trapping in dense regions There is an in- 

timate relation between inhomogeneous link weights and 
network topology in social and communication networks 



lllll2j: Links within communities are strong, while links 



between them are weak. This Granovetter-type structure 
enhances the trapping effect of the communities, leading 
to additional slowing down of spreading 12 1. 

The bursty nature of human interactions has received 
particular interest and it has turned out that the cor- 
responding activity patterns are usually non-Poissonian, 
often power-law correlated (see [Hj]). The effect of bursty 
dynamics on spreading has been approached using em- 
pirical data together with approximate analytical mod- 
els ID, 15 1 . In Ref . 14 , computer worm spreading was 
studied using email logs and the SI model, and it was 
found that the non-Poissonian inter-event time distri- 
bution leads to slow spreading in the late stages of the 
process. Slow spreading was also observed in Ref. fl5j |. 
where an Internet viral marketing experiment was car- 
ried out and modeled as a branching process in the non- 
percolating regime. It was also argued that on the con- 
trary, in the percolating regime, broad inter-event time 
distributions should give rise to faster spreading. 

In this Letter, we study the problem of spreading dy- 
namics in its full complexity, using time-stamped event 
data on human communication networks and the SI 
model. We apply proper null models on the event se- 
quences and show that spreading is slowed down due to 
simultaneous effects of structural and temporal correla- 
tions. 

For the event sequences, we have used the following 
data: a) Mobile phone data from a European operator 
(national market share ~ 20%) with ~ 325 million time- 
stamped voice call records over a period of 120 days. We 
have only retained links with bidirectional calls within 
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the largest connected component (LCC) of the aggre- 
gated call network (MCN), yielding N = 4.6 x 10 6 nodes, 
L = 9 x 10 6 links, and 306 x 10 6 calls. We define 
link weights as the number of calls between two users. 
The network is sparse (average degree (k) = 3.96) show- 
ing small world property with an average shortest path 
length of (/) = 12.31; b) Mobile call data from the Real- 
ity Mining project [18| (RM), where the LCC consists of 
59 users and 93 edges with 2293 calls over ~ 9 months; 
c) email logs [l9| forming a network with the LCC hav- 
ing 2993 nodes and 28843 edges for 202687 events over 
83 days. Here communications are directed and thus the 
nodes belong to the strongly connected component (SCC) 
where all nodes are reachable from each other, or the IN- 
or the OUT-component. 

We study the SI spreading dynamics with simulations 
using the event sequences so that an infected individual 
infects a susceptible one at time t, if there is an event be- 
tween them. For the events, we use records of the times 
and participants of calls, and the times and addresses of 
emails. Calls are one-to-one communication and enable 
bidirectional exchange of information, while emails may 
have multiple addresses and the information flow is di- 
rected. Hence for calls, if either participant is infected 
he/she infects the susceptible one, whereas for emails, 
transmission is from the sender to the recipient (s). We 
initiate simulations by infecting a randomly chosen node 
at a randomly chosen event with the spreading quantity 
(information, rumor, or virus) and set all other nodes 
susceptible. Then the spreading dynamics is simulated 
by using temporally periodic boundary conditions (i.e., 
repeating the event sequence) until the set of reachable 
nodes is exhausted. We record the prevalence, i.e., the 
fraction of infected nodes (I{t)) /N as a function of time 
averaging over 10 3 initial conditions and the time to full 
prevalence t / . For the email network, we start the spread- 
ing process from a node in IN or SCC and iterate the 
process until all nodes in SCC and OUT are infected. 

To gain insight into the effects of different correlations, 
we employ null models where the original event sequences 
are randomized. These are defined so that in each null 
model, some of the correlations are separately destroyed: 
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TABLE I: Correlations retained in different null models. D: 
daily pattern, C: community structure, W: weight-topology 
correlations, B: bursty single-edge dynamics, E: event-event 
correlations between edges. 



community structure (C), weight-topology correlations 
(W), bursty event dynamics on single links (B), and 
event-event correlations between links (E). In addition, 
the overall event frequencies follow a daily pattern (D), 
with decreased night-time activity and some day-time 
peaks (see inset in Fig. [3]) The null models are as follows, 
with the letters indicating retained correlations (TableU)): 

- DCWB (equal-weight link-sequence shuffled): Whole 
single-link event sequences are randomly exchanged be- 
tween links having the same number of events. Tempo- 
ral correlations between links are destroyed. (For large 
weights we did binning with 2-3 weight values.) 

- DCB (link-sequence shuffled): Whole single-link event 
sequences are randomly exchanged between randomly 
chosen links. Event-event and weight-topology correla- 
tions are destroyed. 

- DCW (time- shuffled): Time stamps of the whole orig- 
inal event sequence are randomly reshuffled. Temporal 
correlations are destroyed. 

- D (configuration model): The original aggregated net- 
work is rewired according to the configuration model, 
where the degree distribution of the nodes and connect- 
edness are maintained but the topology is uncorrelated. 
Then, original single-link event sequences are randomly 
placed on the links, and time shuffling as above is per- 
formed. All correlations except seasonalities like the 
daily cycle are destroyed. 

Fig. [T] displays the results for the MCN. In all cases 
the spreading is slow, with full prevalence times tf of the 
order of several hundred days. It is clear that both topo- 
logical and temporal correlations slow down the spread- 
ing. It is the fastest when all correlations except the daily 
patterns are destroyed (configuration model, D). Switch- 
ing on the community structure and associated weight- 
topology correlations (DCW) slows down the spreading 
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FIG. 1: (color online) (Left) Fraction of infected nodes 
(I(t)/N) as a function of time for the original event sequence 
(o) and null models: equal-weight link-sequence shuffled 
DCWB (0), link-sequence shuffled DCB (A), time-shuffled 
DCW (□) and configuration model D (V). Inset: (I(t)/N) for 
the early stages, illustrating differences in the times to reach 
(I(t)/N) = 20%. (Right) Distribution of full prevalence times 
P(tf) due to randomness in initial conditions. 
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FIG. 2: (color online) Spreading dynamics in the Reality Min- 
ing (left) and email networks (right), for the original event 
sequence (o) and null models: DCW (□) and DCWB (0). 
In the email network, the spreading process is directed. The 
maximum prevalence is limited to the total fraction of the 
SCC and the OUT component (~ 85%). 



strongly, as expected because of the bottleneck caused by 
weak links between communities and the broad distribu- 
tion of link weights 



12| . |17| . However, comparing this 



with the DCB null model indicates that bursty single- 
edge dynamics (B) has an even stronger slowing-down 
effect than weight-topology correlations (W) . Finally, in- 
cluding all except event-event correlations (DCWB) gives 
rise to spreading dynamics very close to the original 
event sequence (DCWBE). Here, for early times, DCWB 
spreading is slightly slower than the original one. The left 
panel inset shows quantitative differences in the times to 
20% prevalence. It also indicates that temporal correla- 
tions (E) between adjacent edges have initially a minor 
accelerating effect. This can be attributed to the easy 
reachability of the members within the community where 
the spreading begins. However, for long times, bottle- 
necks appear, and event-event correlations slow the pro- 
cess down. Note that the initial conditions have an effect 
on the duration of the process, reflected in the distribu- 
tions in the right panel of Fig. [T] (the SI process itself 
is deterministic). However, the overall shape of the dy- 
namics and the effects of correlations are consistent for 
individual runs too. 

Results for the Reality Mining mobile call network and 
for the email logs are shown in Fig. [21 with the DCW and 
DCWB null models; the outcome is qualitatively similar 
with that of MCN. However, there are certain differences. 
In the small and sparse RM network, successive calls to 
many people within a short time period by a hub give 
rise to a steep prevalence rise. Such behavior is a one-off 
event and the effect is destroyed in the null models. In the 
email network, very high-degree hubs sending frequent 
emails give rise to rapid spreading once they are reached. 
This effect is conserved in the null models. 

The daily activity pattern, i. e. variation in overall com- 
munication frequency by the hour, is retained in every 
null model that is based on randomizing the original 
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FIG. 3: (color online) Spreading dynamics as obtained from a 
Poissonian event-generating model on the aggregated MCN, 
with daily pattern (□) and without (V). Link weights were 
taken into account and the curve with the daily pattern is 
comparable with the DCW null model. Inset: the average 
daily pattern as observed for the MCN event sequence with 
binning by the hour. The continuous line is to guide the eye. 



event sequence. In [20j, it was suggested that natural 
periodicities, such as the daily cycle, are responsible for 
the fat-tailed waiting time distributions. In order to eval- 
uate the impact of the daily pattern on the spreading 
speed, we carried out simulations where the aggregated 
MCN was used as the lattice. Events were generated 
on its links by two Poisson processes that conserve link 
weights: a homogeneous Poisson process, and a process 
whose instantaneous rate follows the daily pattern as cal- 
culated from the call statistics on hourly basis (see inset 
in Fig. [3l). The SI dynamics for both cases are shown 
in Fig. |31 The difference between the two curves is neg- 
ligible, demonstrating that the daily pattern has only a 
minor impact on the spreading speed. This, together 
with the observation that temporal correlations do have 
a significant decelerating effect on spreading strongly in- 
dicates that there are important, non- Poissonian corre- 
lations in the system beside the daily type cycles. 

The non-Poissonian, bursty character of event se- 
quences is clearly demonstrated by the fat-tailed distribu- 
tion of single-link inter-event times for the MCN, as seen 
in Fig. |U In order to exclude the possibility that the 
fat tail in the inter-event time distribution is only due 
to the broad weight distribution as suggested in [20j , we 
calculated the distributions for binned weights and ob- 
tained a satisfactory scaling with the average inter-event 
time, similarly to [16]. We find that the distribution can 
be fitted by a power law with an exponent 0.7 over 3.5 
decades, followed by a fast decay. The scaling breaks 
down for small inter-event times, where a peak in the 
distribution at ~ 20 seconds is found. This peak is due 
to event correlations between links. The power law indi- 
cates the non-Poissonian, bursty character of the events. 
Both the characteristics vanish for the time-shuffled null 
model DCW, and the inter-event time is well described 
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FIG. 4: (color online) Scaled inter-event time distributions 
for the MCN. Edges were log-binned by weight and for ev- 
ery second bin the inter-event time distribution of the events 
occurring in the corresponding bin is shown, scaled by the av- 
erage inter-event time of that bin r*. Inset: scaled inter-event 
time distributions for the original (o) and for the time-shuffled 
events (□). An exponential density distribution with average 
value of 1 is shown as a light (yellow) line. 



complex communication networks. Using time-stamped 
event data and appropriately prepared null models we 
have managed to quantitatively distinguish between dif- 
ferent contributions to the slowing down of spreading. 
We have shown that the main contributions are (i) the 
community structure and its correlation with link weights 
and (ii) the inhomogeneous and bursty activity patterns 
on the links. Somewhat surprisingly, the daily pattern 
and event correlations between links seem to play only 
a minor role in the overall spreading speed. Finally, we 
believe that our null models can be generally applied to 
investigate the effects of temporal and structural corre- 
lations on dynamic processes on networks. 
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by an exponential function (see inset of Fig. [4]), i.e., the 
process is Poissonian. 

The effect of burstiness on the spreading speed can 
be easily demonstrated with the following single-link cal- 
culation. Let us denote the average time for the infec- 
tion to spread through a link (the residual waiting time) 
by (tr), and assume that one of the nodes gets infected 
at a uniformly chosen random time. Similarly to Irib- 
arren et al. [HI and Vazquez et al, [3] we calculate 
(tr) for a given inter-event time distribution P(r). For 
simplicity, we consider how the burstiness introduced by 
a continuous power-law distribution of inter-event times 
P(t) ~ r~ Q affects the average infection times when 
compared to a Poisson process. If we fix the average 
inter-event time (and thus the number of events for a long 
observation period), the ratio of average infection times 

is r (TR, power law) / (^i?,, poisson) 2(a — l)(a — 3) ^ ^ ^* 

Now r is decreasing with a, r < 1 when a > 2+y2 m 3.4, 
and r goes to infinity at a — 3. This indicates that the 
burstiness characterized by power law distributions with 
slow decay has a decelerating effect on spreading with re- 
spect to the Poisson process with the same mean. How- 
ever, if the decay is fast enough, i.e., the second moment 
of the power law distribution is smaller than that of the 
Poisson distribution, we see acceleration. This mean field 
type of reasoning has its limitations. Nevertheless it illus- 
trates the mechanisms of slowing down because of bursts: 
the residual waiting time increases because the chance for 
long waiting times after getting infected increases. 

In conclusion, we have studied the effects of differ- 
ent topological and temporal correlations on spreading in 
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